Toposław - A Lexicographic Framework for Multi-word Units

نویسندگان

  • Malgorzata Marciniak
  • Agata Savary
  • Piotr Sikora
  • Marcin Wolinski
چکیده

The paper presents a tool for the creation of an electronic dictionary of multi-word proper names. Toposław uses graphs for the representation of inflectional and pragmatic variants of names. It cooperates with Morfeusz, a morphological analyser and generator for Polish words, and Multiflex, a cross-language morpho-syntactic generator of multi-word units. Our goal was to create a userfriendly tool that makes a lexicographic work easy and efficient. In the paper we describe facilities for graph creation, management and debugging. The presented tool was applied to create a dictionary of Warsaw urban proper names.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On multiword lexical units and their role in maritime dictionaries

Multi-word lexical units are a typical feature of specialized dictionaries, in particular monolingual and bilingual maritime dictionaries. The paper studies the concept of the multi-word lexical unit and considers the similarities and differences of their selection and presentation in monolingual and bilingual maritime dictionaries. The work analyses such issues as the classification of multi-w...

متن کامل

Lexicographic goal programming approach for portfolio optimization

This paper will investigate the optimum portfolio for an investor, taking into account 5 criteria. The mean variance model of portfolio optimization that was introduced by Markowitz includes two objective functions; these two criteria, risk and return do not encompass all of the information about investment; information like annual dividends, S&P star ranking and return in later years which is ...

متن کامل

Using lexicographic parametric programming for identifying efficient hyperpalnes in DEA

This paper investigates a procedure for identifying all efficient hyperplanes of production possibility set (PPS). This procedure is based on a method which recommended by Pekka J. Korhonen[8]. He offered using of lexicographic parametric programming method for recognizing all efficient units in data envelopment analysis (DEA). In this paper we can find efficient hyperplanes, via using the para...

متن کامل

Obtaining a Unique Solution for the Cross Efficiency by Using the Lexicographic method

Cross efficiency is a method with the idea of peer evaluation instead of self-evaluation, and is used for evaluation and ranking Decision Making Units (DMUs) in Data Envelopment Analysis (DEA). Unlike most existing DEA ranking models which can only rank a subset of DMUs, for example non-efficient or extreme efficient DMUs, cross efficiency can rank all DMUs, even non-extreme ones. However, sinc...

متن کامل

Connected Component Based Word Spotting on Persian Handwritten image documents

Word spotting is to make searchable unindexed image documents by locating word/words in a doc-ument image, given a query word. This problem is challenging, mainly due to the large numberof word classes with very small inter-class and substantial intra-class distances. In this paper, asegmentation-based word spotting method is presented for multi-writer Persian handwritten doc-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009